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SUMMARY 


This  Memorandum  describes  work  done  in  mapping  image 
processing  algorithms  on  to  transputer  arrays.  Object 
classification  based  on  the  autocorrelation  function  has  been 
applied  to  infrared  images.  Using  the  OCCAM  programming 
language,  parallel  processing  techniques  have  been  developed 
to  make  use  of  the  concurrency  available  for  communication 
and  calculation.  The  results  for  this  specific  problem  are 
assessed  and  conclusions  drawn  on  the  use  of  transputers  and 
OCCAM  for  parallel  processing. 
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1  INTRODUCTION 

Previous  work  (1]  has  shovn  that  a  simple  measure  of  the  shape 
of  the  autocorrelation  function  is  a  useful  discriminant  for  target 
classification  in  infra-red  images.  It  has  been  found  that  it  is  possible 
to  recognise  vehicles  from  texture  information  and,  because  object  shape 
information  is  not  used,  it  is  veil  suited  to  thermal  imagers  vhich  may 
not  resolve  shape  or  edge  details.  The  algorithms  in  this  paper  have  been 
adapted  and  developed  for  evaluation  on  arrays  of  transputers.  Much 
experience  has  been  gained  in  the  design  and  development  of  algorithms 
and  the  approach  required  for  efficient  use  of  transputers. 

The  problem  has  been  approached  as  follovs  : 

i)  Distribute  the  image  under  consideration  across  a  netvork 
of  transputers 

ii)  Reduce  noise  in  the  image 

iii)  Threshold  the  image  to  distinguish  bright  (hot)  areas  of 
potential  interest 

iv)  Label  connected  regions  and  extract  size  and  position 
parameters 

v)  Calculate  the  autocovariance 

vi)  Detect  targets  and  display  and  file  the  results 

At  each  stage  care  vas  taken  to  make  as  much  use  as  possible  of  the 
concurrency  available,  and  not  to  rely  on  the  sequential  algorithms 
already  tested  in  (1].  One  of  the  aspects  of  parallel  processing 
investigated  vas  the  difficulties  encountered  in  trying  to  adapt 
sequential  algorithms  for  efficient  execution  on  a  parallel  machine.  In 
many  cases  a  completely  nev  approach  is  needed,  and  a  simple 
transformation  of  the  old  code  proves  impossible.  Conversely  there  are 
occasions  vhen  inherent  parallelism  in  the  problem  can  give  excellent 
results  vith  only  minor  changes  to  the  sequential  algorithm. 
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ARCHITECTURE 


The  Reconfigurable  Transputer  Processor  (RTP)  being  developed  in 
ESPRIT  project  P1085  uses  hierarchically  clustered  T800  transputers 
connected  through  a  VLSI  switch.  The  T800  [2]  is  an  advanced  variant  of 
the  transputer  developed  for  high  speed  manipulation  of  floating  point 
nuabers.  The  transputer  is  a  single  chip,  powerful  Microprocessor 
designed  as  a  module  for  use  in  the  construction  of  multi-processor 
systems.  The  T800  incorporates  a  fast  32-bit  integer  processor  and  a 
floating  point  64-bit  processor,  4  kilobytes  of  fast  static  RAM  and  four 
high-speed  communication  links.  Each  RTP  node  (  "Supernode"  )  consists  of 
16  T800  worker  transputers  each  with  256  Kbytes  of  external  memory,  a 
switch  controller  transputer,  and  an  additional  transputer  with  a  large 
amount  of  memory  (16  Mbytes)  which  can  be  used  for  storing  and 
distributing  data  and  code.  All  the  transputer  links  are  connected  to  a 
pair  of  VLSI  switches,  in  such  a  way  that  a  program  running  on  the  switch 
controlling  transputer  can  set  up  any  4-connected  network  required.  A 
single  node  can  be  used  as  a  powerful  workstation  with  a  performance  in 
excess  of  16  Megaflops,  or  nodes  may  be  connected  together,  again 
reconf igurably,  to  form  a  machine  with  up  to  64  nodes. 

3  TRANSPUTER  ARRAY  CONFIGURATION 

The  work  was  carried  out  on  a  prototype  RTP  node  configured  as 
shown  in  figure  1.  T414  transputers  (  32-bit  CPU  and  no  floating  point 
hardware)  were  used,  and  some  facilities  of  the  RTP  machine,  such  as  the 
command  bus  (  mainly  intended  to  aid  diagnostic  work)  were  not  available. 

The  16  'worker'  transputers  are  connected  in  a  4  x  4  grid.  Each 
processor  works  on  a  square  patch  representing  1/16  of  the  original 
image.  The  picture  under  consideration  is  sent  from  a  file  held  on  the 
disc  of  the  IBM  PC  via  the  svitch  controller  (SC)  to  processor  13  in  the 
array.  From  there  it  is  passed  across  and  up  the  array  (  and  up  to  a  B007 
transputer  graphics  board  for  visual  display).  It  is  intended  in  future 
to  provide  digitised  images  from  a  TV  camera  via  a  framestore  controlled 
by  a  further  transputer.  The  spare  transputer  links  fro*  processors  P13 
to  P16  will  be  used  to  ease  the  bottleneck  in  distributing  the  picture 
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data  to  the  array.  It  is  clear  from  our  results  that  this  will  lead  to 
■uch  enhanced  over  all  throughput. 


figure  1 


4  OCCAM 

Although  transputer  compilers  for  other  languages  (e.g.  Prolog, 
C,  Fortran)  exist,  OCCAM  vas  chosen  for  programming  this  problem. 
OCCAM  [3]  is  the  first  language  to  be  based  on  the  concept  of  parallel, 
in  addition  to  sequential,  execution,  and  to  provide  automatic 
communication  and  synchronisation  betveen  concurrent  processes.  OCCAM 
enables  an  application  to  be  described  as  a  collection  of  "processes", 
each  executing  concurrently,  and  able  to  communicate  vith  other  processes 
via  "channels".  The  program  is  thus  structured  as  modules,  and  the 
channels  provide  the  connections  betveen  the  modules.  OCCAM  thereby 
allovs  a  program  to  be  vritten  in  a  clearly  defined  and  structured 
manner,  vhich  exploits  the  parallelism  available  in  a  transputer  network. 
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LOW  LEVEL  IMAGE  PROCESSING 


Several  low-level  operations  are  performed  on  the  original  image 
in  order  to  derive  a  nev  image  in  which  connected  areas  of  a  higher  pixel 
intensity  ("brightness")  are  distinguished  from  the  surrounding  areas  of 
lover  pixel  intensity  ("background"). 

a)  CONVOLUTION 

A  convolution  operation  using  a  3  x  3  weight  matrix  is  first  applied  for 
'noise'  reduction.  This  smooths  out  isolated  bright  and  dark  pixels.  The 
value  of  a  pixel  p  at  (i,j)  is  replaced  by  q  where 

(  M  l*1  ) 

q(i,j)  «  <  £  £  p(m,n)  )  /  8 

l.  m-i-1  n-j-1  J 

It  is  clear  that  to  perform  this  operation  on  pixels  at  the  edge  of  a 
local  image  (i.e.  at  the  edge  of  one  of  the  distributed  patches)  pixel 
data  is  required  from  neighbouring  processors.  In  each  processor  ar  edge 
overlap  is  performed  to  extend  the  local  picture  by  1  pixel  all  round. 
First  the  horizontal  and  vertical  edges  are  swapped  and  then  the  4  corner 
pixels.  For  example  in  processor  6:  first  the  edges  are  exchanged  with 
the  neighbouring  processors  2,  7,  10  and  5,  and  then  the  corner  pixels 
are  exchanged  with  processors  1,  3,  11  and  9,  via  2,  7,  10  and  5.  The 

design  of  the  transputer  allows  communication  via  any  link  to  occur 
concurrently  with  communications  on  all  other  links,  and  so  copying  a 
small  section  of  a  neighbour's  data  is  a  very  small  overhead  in  overall 
processing  time. 

b)  THRESHOLDING 

The  nev  filtered  image  is  thresholded  to  produce  a  binary  image.  Several 
different  approaches  have  been  used  to  set  the  threshold  : 

1)  a  purely  local  threshold  value  based  on  a  local  (within  the 
patch)  maximum,  minimum  and  mean  (average)  of  pixel  intensities. 
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ii)  a  global  aean  ,  in  vhich  case  local  values  are  sent  to  the 
controller  and  a  global  maximum,  minimum  and  Bean  calculated.  This  gives 
poor  results  in  pictures  vhere  areas  are  only  'bright'  in  a  local 
context. 

iii)  a  aore  'graded'  local  threshold  vhere  local  aean  values  for 
neighbouring  processors  are  used  in  the  calculation.  The  threshold  value 
for  any  pixel  depends  on  its  relative  position  in  the  subiaage  i.e.  its 
distance  fron  any  of  the  boundaries.  This  involves  exchanging  data 
between  processors. 

Hethod  ii)  vas  discarded  as  it  gave  poor  results  when  the 
background  across  the  whole  iaage  vas  not  consistently  darker  than  the 
possible  targets.  Hethod  iii)  vas  the  aost  effective  but  took  50Z  longer 
to  calculate,  as  it  involved  data  swapping  and  a  aore  complicated 
algorithm.  In  most  cases,  aethod  i)  vould  be  adequate. 

In  each  processor  a  binary  image  is  created  containing  l's 
vhere  there  are  pixels  above  the  threshold  level  in  the  filtered  picture 
and  0's  elsewhere.  It  is  now  assumed  that  the  pixels  in  any  possible 
areas  of  interest  in  the  original  iaage  have  been  selected.  The  only 
pixels  vhich  will  be  considered  are  those  marked  by  a  1  in  the  binary 
image. 


6  TARGET  DISCRIMINATION  AGAINST  BACKGROUND 

'Targets'  are  connected  areas  of  pixels  vhich  have  been  marked 
by  a  1  in  the  binary  iaage.  The  aark  is  changed  to  a  target  marker  value. 
In  the  binary  iaage,  the  neighbours  of  a  point  found  to  be  in  a  target 
are  checked  for  membership  of  the  same  target.  When  a  nev  point  is  added 
to  a  target,  the  parameter  values  for  that  target  (area,  x  and  y  minimum 
and  maximum)  are  updated.  If  it  is  found  that  one  target  is  'touching' 
another  i.e.  adjacent  pixels  have  different  Barker  values,  an  entry  is 
made  in  a  merger  table.  Vhen  all  the  pixels  in  the  binary  iaage  have  been 
scanned,  and  all  the  l's  have  been  replaced  by  aarkers,  the  merger  table 
is  used  to  re-aark  each  target  vith  a  consistent  Barker  value  and  to 
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■erge  the  parameter  values.  At  this  stage  each  processor  contains  a  list 
of  'targets'  and  a  picture  Marked  to  shov  the  pixels  in  each  target.  Only 
the  actual  points  contained  in  each  target  are  needed  for  the  calculation 
of  the  autocorrelation  function.  This  can  lead  to  complications  when  the 
local  target  contains  points  along  the  edge  of  the  local  picture.  Care 
aust  be  taken  not  to  introduce  discontinuities  across  processor 
boundaries  which  would  introduce  errors  into  the  calculations. 


7  AUTOCORRELATION 

The  one-dimensional  autocorrelation  function  d(x)  of  x(t), 
referred  to  a  displacement  x  is  defined  by 


♦(t) 


n 

x(t).x(t+T)  dt 

_O0 


For  a  discretely  sampled  x,  the  autocorrelation  coefficient  for  a 
displacement  r  is  estimated  as 

Rr  .  R(r)  «  ♦(rdx)  -  1  (Xf)  -  xM*n+r  -  x) 

N-t 

vhere  N  is  the  number  of  samples  used  and  x  the  mean  value  of  x  (which 
here  corresponds  to  image  intensity)  within  the  marked  target. 

So  that 

^  -  2  2 

Rn  *  1  E  (x  -  x)  «  S  «  variance  of  x 

v  «  n 

N  n*l 

If  Rf  is  normalised  we  obtain  the  autocovariance  •  Rj/Rq  • 

Then 


-1  <  Cr  <  +1  and  CQ  -  1 

The  shape  parameters  and  Sg  approximate  the  magnitude  and  sense  of 
curvature  of  the  autocovariance  near  its  peak,  on  two  different  scales  of 
distance. 
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51  '  (C1  "  C2)/(C0  "  cl> 

52  *  (C2  ’  C4>/(C0  •  C2) 

and  ve  have  used  the  values  of  Cj  and  and  S2  to  classify  targets. 
Correlation  functions  in  the  x-direction  only  are  used.  This  is  for 
simplicity  and  in  order  to  prove  the  concepts  rather  than  to  make  a  fully 
automatic  classifier. 


8  AUTOCOV ARI ANCE  CALCULATION 


The  equation  for  Cr  requires  that  the  mean  value  x  vithin  each 
separate  target  needs  to  be  calculated  before  accumulating  sums  of 
products.  To  avoid  scanning  the  image  tvice  ,  first  to  find  the  mean  and 
secondly  to  calculate  Cf  for  each  target,  the  equation  can  be  rewritten 


Cr 


1. 

\ 


JVC 


X  .X 

n  n+r 


- 


x  ) 

n+r' 


;2 


x-r 


/  R, 


where 


Ny  «  number  of  lines  in  target  subimage 

Nx  ■  number  of  pixels  on  line  vithin  subimage 

«  x  -  x  .  +1 

max  min 

r  -  0,1, 2, 3, 4 


In  practice,  there  nay  be  gaps  along  a  line  between  pixels  which  belong 

to  the  same  target,  in  which  case  each  set  of  connected  points  is  treated 

as  a  separate  line,  local  values  for  x  and  x  ,  are  used,  and  N  is 

I8X  Bin  y 

amended . 


If  any  of  the  targets  found  and  marked  in  the  binary  image  is 
completely  contained  vithin  its  own  processor,  and  does  not  include  any 
points  on  the  boundary  of  the  local  image,  the  autocovariance  calculation 
can  nov  be  computed.  In  the  tables  giving  timing  details  these  targets 
are  referred  to  as  ’local  targets'.  Tvo  different  approaches  to  the 
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problem  of  targets  which  overlap  adjacent  processors  ('global  targets') 
have  been  tested,  and  a  third  considered* 

i)  The  target  parameters  and  the  marked  picture  are  sent  up 
the  array  from  each  processor  and  collected  by  the  transputer  in  the  IBM 
PC.  Touching  targets  are  combined,  and  for  each  target  the 
autocorrelation  function  can  be  calculated  using  the  original  picture 
stored  on  the  IBM  PC  disc  and  an  amalgamated  marked  picture. 

ii)  Partial  sums  of  pixel  values  ,  products  and  number  of  valid 
pixels  are  amassed  locally  and  sent  to  the  IBM  PC  for  collation  (or  any 
'controller'  transputer  could  be  used).  To  allow  values  of  r  up  to  4, 
where  r  is  the  distance  between  pixels  in  a  sum  or  product  in  the 
equation,  extra  overlaps  of  original  picture  and  marked  target 
information  are  required  from  neighbouring  transputers. 

iii)  A  strategy  of  switching  the  machine  connectivity  was 
considered,  in  which  processing  is  done  as  above  up  to  the  target 
discrimination  stage,  and  then  the  transputer  array  is  reconfigured  to 
cope  vith  the  ACF  calculation  to  make  more  efficient  use  of  all  the 
processors.  Unfortunately  it  is  not  possible  to  reconfigure  the 
transputer  network  during  program  execution  without  making  use  of  the 
control  bus,  which  was  not  available  on  the  prototype  machine.  It  is 
planned  to  examine  the  feasibility  of  this  approach  on  the  ESPRIT  RTP 
node  when  it  becomes  available. 


9  RESULTS  SUMMARY 

The  variants  of  the  above  algorithms  were  programmed  and  the 
results  compared.  The  figures  for  timings  were  obtained  using  many 
methods  for  attaining  faster  performance,  in  particular 

a)  an  OCCAM  compiler  which  optimises  the  use  of  the  fast  local 

memory 

b)  optimising  techniques  available  in  the  OCCAM  language 

c)  as  much  use  of  concurrency  as  possible,  in  particular  with 
regard  to  overlapping  calculations  and  communications. 
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The  following  execution  tines  were  achieved  over  the  16  transputers  for 
convolution  and  thresholding  (timings  in  seconds) 


picture 

local 

graded 

Processor 

overlap 

convolution 

threshold 

threshold 

PI 

0.0019 

0.4344 

0.1292 

0.1827 

P2 

0.003 

0.4364 

0.1293 

0.1787 

P3 

0.0033 

0.4364 

0.1291 

0.179 

P4 

0.0026 

0.4364 

0.1293 

0.1789 

P5 

0.0026 

0.4364 

0.1292 

0.1791 

P6 

0.0029 

0.4364 

0.1292 

0.1791 

P7 

0.0031 

0.4364 

0.1292 

0.1791 

P8 

0.0025 

0.4364 

0.1295 

0.1788 

P9 

0.0026 

0.4364 

0.1295 

0.1782 

P10 

0.0029 

0.4364 

0.1295 

0.1782 

Pll 

0.0029 

0.4364 

0.1295 

0.1786 

P12 

0.0023 

0.4364 

0.1296 

0.1784 

P13 

0.0019 

0.4364 

0.1295 

0.1817 

P14 

0.0022 

0.4364 

0.1297 

0.1778 

P15 

0.0022 

0.4364 

0.1296 

0.178 

P16 

0.0022 

0.4364 

0.1294 

0.1784 

It  is  easy  to  see  that  apart  from  the  overlap  function,  where 
position  in  the  transputer  network  is  important  (for  instance  PI  has  only 
2  edges  to  swap),  the  low-level  operations  are  data-independent  and 
timing  differences  are  caused  by  the  positioning  of  variables  in  memory 
by  the  compiler  or  programmer.  It  is  obviously  important  to  make  as  much 
use  as  possible  of  the  fast  local  memory. 


For  the  autocovariance  calculation,  timings  were  obtained  for 
both  methods  i)  and  ii)  of  Section  8  for  targets  which  overlap  processor 
boundaries. 

i)  Timings  for  the  transputers  in  the  network  when  target 
merging  and  calculations  are  done  on  the  IBM  PC  (in  seconds) 
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Processor 

target 

calculate 

total 

discrimination 

local  ACF 

run  time 

PI 

0.4055 

0.0 

4.4127 

P2 

0.4255 

0.0 

4.4009 

P3 

0.5176 

0.0 

3.4486 

P4 

0.4611 

0.1062 

3.449 

P5 

0.3964 

0.0178 

4.3851 

P6 

0.3567 

0.0 

4.3599 

P7 

0.5514 

0.0083 

3.3889 

P8 

0.2952 

0.0529 

3.3892 

P9 

0.3178 

0.0 

4.4259 

P10 

0.3494 

0.0 

3.3467 

Pll 

0.2079 

0.0 

3.3396 

P12 

0.2649 

0.0 

3.3366 

P13 

0.3681 

0.0 

5.7917 

P14 

0.2649 

0.0 

4.425 

P15 

0.2191 

0.0164 

3.4375 

P16 

0.8854 

0.098 

3.4378 

and 

timings  on  the 

IBM  PC  transputer 

are  : 

global  amalgamation  and  calculation  :  1.4365s 


total  run  time  (including  6.015s  to  read  file)  :  16.2131s 

ii)  Timings  in  the  network  when  partial  calculations  are  done 


locally  and  amalgamated  on  the  IBM  PC  transputer 

Processor  target 

global  ACF 

total 

discrimination 

calculation 

run  time 

PI 

0.3975 

0.1852 

2.4842 

P2 

0.424 

0.1202 

2.4824 

P3 

0.5151 

0.1731 

2.4824 

P4 

0.4535 

0.0 

2.4827 

P5 

0.3952 

0.1619 

2.427 

P6 

0.3571 

0.1496 

2.4262 

P7 

0.5478 

0.14 

2.4265 

P8 

0.2948 

0.0 

2.4268 

P9 

0.3165 

0.0049 

2.3887 

P10 

0.3481 

0.0259 

2.251 

Pll 

0.2085 

0.0125 

2.2511 

P12 

0.2646 

0.0184 

2.2512 

P13 

0.3617 

0.0008 

2.4326 

P14 

0.2644 

0.0065 

2.3879 

P15 

0.2195 

0.0 

2.2585 

P16 

0.8747 

0.0 

2.2588 

with  timings  on  the 

IBM  PC  transputer 

: 

time  to 

receive  partial  ACF  elements 

:  0.1921s 

time  to 

collate  partial  sums 

:  0.0591s 

total  run  time  (including  6.0089s  to  read  file) 

:  12.4522s 
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It  can  easily  be  seen  that  method  ii),  even  though  it  involves 
extra  data  movement,  is  auch  faster  (ignoring  the  file  reading  time, 
processing  takes  6.443s  rather  than  10.19Bs). 

10  CONCLUSIONS 

Many  conclusions  can  be  dravn  from  the  results  of  the  various 
aethods  and  techniques  employed  to  calculate  autocorrelation  functions 
using  a  netvork  of  transputers.  The  problem  has  two  distinct  components  - 
the  lov-level  image  processing,  followed  by  the  feature  extraction  and 
calculation  phase.  It  is  clear  that  during  the  lov-level  stage  all  the 
transputers  in  the  netvork  are  doing  the  same  operations  at  the  same  time 
and  so  there  are  no  problems  of  efficiency,  or  need  for  load  balancing. 
In  fact  this  is  true  of  any  problem  vhich  can  be  decomposed  in  a 
geometric  fashion  into  identical  sub-problems  working  on  a  subset  of  the 
data  with  only  a  minimum  need  for  inter-processor  communication. 

The  second  phase  is  much  more  data-dependent  and  so  it  is  more 
difficult  to  design  a  problem-solving  strategy  vhich  uses  all  the 
transputers  as  efficiently  as  possible.  At  first  it  was  felt  that  the 
quasi-dynamic  switching  approach  vould  be  the  most  efficient.  In  this 
method,  once  target  areas  have  been  located  in  the  subimages,  the  feature 
extraction  and  ACF  calculation  vould  be  farmed  out  over  a  new  netvork. 
Hovever  this  could  involve  a  large  amount  of  data  redistribution,  not 
only  of  the  original  picture  but  also  of  the  marked  target  areas.  It  is 
soon  found  that  the  addition  of  large  amounts  of  data  passing  is  both 
time  consuming  (at  run  time)  and  also  presents  many  opportunities  for 
introducing  deadlock,  a  situation  in  vhich  tvo  or  more  processes  are 
perpetually  vaiting  for  communication  from  each  other. 

The  communications  handling  soon  proved  itself  to  be  the  more 
complicated  task  from  the  programmer's  point  of  view,  while  the  coding  of 
the  numerical  algorithms  vas  relatively  easy  (particularly  with  the 
checking  mechanisms  available  in  the  OCCAM  compiler).  Any  solution  using 
parallel  processing  techniques  needs  to  take  into  account  hov  much 
programming  time  vill  be  required,  not  only  for  designing  and  coding  but 
also  for  testing  and  debugging.  The  debugging  tools  available  at  present 
are  rather  limited,  although  more  aids  are  being  developed.  There  is 
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often  a  heavy  penalty  to  pay  (in  teras  of  programmer  time)  for 
sophisticated  algorithms  which  produce  enhanced  performance.  It  is  often 
extremely  difficult  to  make  even  minor  modifications,  and  in  some  cases  a 
small  change  in  specification  can  lead  to  a  major  redesign  of  the 
program.  The  importance  of  the  algorithm  design  phase,  in  particular  the 
choice  of  parallel  decomposition  of  the  problem,  cannot  be 
over-emphasised.  A  simplicity  of  concept  is  of  paramount  importance. 
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